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Abstract 



Let A and B be two affinely generating sets of ZJj. As usual, we denote their Minkowski sum 



o 



by A + B. How small can A + B be, given the cardinalities of A and Bl We give a tight answer to 
^ this question. Our bound is attained when both A and B are unions of cosets of a certain subgroup 

of Zjj . These cosets are arranged as Hamming balls, the smaller of which has radius 1. 

By similar methods, we re-prove the Freiman-Ruzsa theorem in Z£, with an optimal upper 
bound. Denote by F(K) the maximal spanning constant |(v4)|/|A| over all subsets A C Z£ with 
doubling constant \A + A|/|A| < K. We explicitly calculate F(K), and in particular show that 
A K /4K < F(K) ■ (1 + o(l)j < 4 K /2K. This improves the estimate F(K) = poly{K)A K , found 
! ! recently by Green and Tao [T7] and by Konyagin [25] . 

o 

O 1 Introduction 

Much work has been devoted to the study of Minkowski sums of sets. Questions concerning such 

S sums come up in geometry, and are at the core of additive combinatorics. Research in this area 
has blossomed in recent years, and even Tao and Vu's monograph }36] no longer covers all the most 
recent developments. In this paper we concentrate on the Minkowski sum of two generating sets 

(N ofzr, 1 . 

We first review some of the relevant literature. Let G be an abelian group, and let A and B be 
two finite subsets of G. As usual, we denote 

^ A + B = {a + b\aeA,beB} 



OO 
O 



and we ask about the minimum of |A + B\, given the cardinalities of A and B. 

In general, the answer ranges from max(|A|, \B\) to |A| + \B\ — 1, depending on the structure 
of G. For a torsion-free G, if A and B are arithmetic progressions with the same step, then 
\A + B\ = \A\ + \B\ — 1, which is optimal. Likewise, if G — Z p is cyclic of prime order, then the 
. answer is given by the Cauchy-Davenport theorem, |A+£?| > min(|A| + |i?| — 1, \G\) [US]. Moreover, 

by a theorem of Vosper [38], if |A| + |_B| < \G\ then equality holds only for arithmetic progressions. In 
the other extreme case, G has a finite subgroup of a suitable cardinality. Thus, if H<G is a subgroup 
of cardinality \H\ — max (|A|, \B\), an optimal choice is to have A and B be subsets of H, in which 
case | A + B\ = max (|A|, \B\). More generally, \A + B\ can be as small as max (|A|, \B\) if and only 
if min(|A|, 1^1) < \H\ and \H\ divides max(|A|, \B\) 36, p. 55]. In the general case [5] [TO], the 
smallest possible cardinality of |A + B| is min (|~|A|/|ff|] + — 1) \H\, where the minimum 

is over all finite subgroups H of G. In a sense, this result interpolates between the two extremes. 
In an optimal construction [HUH] the sets A and B are contained in and |~|-B|/|i?|] cosets 

of H, whose arrangement is a lexicographical variant of an arithmetic progression. In particular, 
for G a 2-torsion group this reduces to the well-studied Hopf-Stiefel function [201 [Ml HI O ■ 
Stability is a recurring theme in modern extremal combinatorics. Once an extremal problem is 
solved, it is interesting to explore what happens when we consider candidate solutions that do not 
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resemble the global optimum. The crucial feature of the above-mentioned optimal constructions is 
that A and B are densely packed in cosets of properly chosen subgroups of G. We therefore return 
to the original question, under the requirement that A and B are not allowed to be contained in a 
proper subgroup of G or a coset thereof. The affine span of A, denoted (A), is the smallest coset (of 
any subgroup) containing A. We say that A affinely generates G if (A) — G. Clearly this definition 
coincides with the usual notion of a generating set if € A. The refined problem is as follows: In 
a finitely generated abelian group G, find min \A + B\ as a function of \A\ and \B\, where A and B 
are finite affinely generating subsets of G. 

Naturally, the structural properties of G play a role in this problem as well. For the torsion-free 
case, G = Z d , this question and similar ones were discussed by Ruzsa [30] . and a full answer was 
finally given by Gardner and Gronchi [15j . In the extremal construction, the smaller set is a simplex 
of d + 1 points, on one of whose edges lies an arithmetic progression, and the other set is roughly 
the sum of several copies of it. As discussed there, this is analogous to the Brunn-Minkowski 
theorem [55] . 

Here we present the following lower bound for the opposite extreme of a 2-torsion group, G = Z£ . 

Theorem 1. Suppose A, B C G = 1% such that (A) =G, B ^ and \ A\ < §|G|. 
// 1 is the largest positive integer such that 



Ml < *-±±-\G\ 



and < k < t and w G [—1,1] are such that 



\ B \ = (o)+(i)+-+a)+MV) lcl 
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then 

\A + B\ > vu/ ' yi/ ' VB ' ' yK+1/ ' ^±±1-|G| 

This bound is tight when w = 0, and it is attained by the sets 

A = D\x Z£ _ * B = Dl x Z™"* A + B = D{ +1 x Z 

where D k = {x G Z| | #{i\xi = 1} < k} is a Hamming ball of radius k in7L\. 

The Freiman-Ruzsa theorem |31j is a major result in additive combinatorics. In the context 
of the above discussion, it addresses the special case A = B. It states that if A is a subset of an 
r- torsion abelian group with \A + A\ < K\A\, then A is contained in a coset of cardinality at most 
F(K)\A\, with F(K) = K 2 r K . The special case r = 2 has received considerable attention [3 
[TBI [T71 IT{?1 12"31 12HI I3"2l [3T] . Among the most recent contributions is work by Green and Tao [T7] with 
further improvement by Konyagin [23] • It shows that one can take F(K) = 2 2K+ °( logK \ Here we 
exactly determine the lowest possible value of F(K) for r — 2. 

Theorem 2. For K > 1, denote by t > 1 the unique integer for which: 

Q±t±± <K< Cj 1 l±i*+il± 1 
t + i - (t + 1) + 1 

ForACl% such that \A + A\/\A\ <K, we have \(A)\/\A\ < F(K) where: 

I t 2 +t+l 2i — A ^ (t + l) + l 

T/izs choice of F{K) is tight, and grows as Q(2 2K / K) . 

Compression is an important tool from extremal set theory. Much progress in the application of 
compression to additive problems was made by Bollobas and Leader in [TJ , and it is a key ingredient 
in Green and Tao's proof in [17] ■ There is a whole range of compression operators C, that transform 
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an arbitrary set A to another set C(A), with \C(A)\ = \A\ and \C(A) + C(A)\ < \A + A\. By a 
finite sequence of such compressions, it is possible to reduce to the case where A is compressed in 
some appropriate sense, and hence has certain structural properties, which make A + A easier to 
study. The difficulty is that C(A) need not be affinely generating even if A is. Green and Tao 
handled this difficulty by restricting the types of compression operators they used. Our approach 
is different. We employ more types of compression operators and we proceed as long as possible 
without jeopardizing affine generation, i.e., as long as (C(A)) = (A). 

Isoperimetric inequalities play an important role in our work. In our investigations of A + B, 
we prove a new variant of the isoperimetric inequality for the hypercube. 

Overview: In Section [2] we discuss compressions and other useful tools. We explore the key 
notion of compression that maintains affine generation. In Section[3] Theorem [2] is proved, first in 
an asymptotic form, then with the exact expression. In Section [4jwe establish Theorem [I] The 
proof utilizes our new isoperimetric inequality. 



2 Tools 

In this section we briefly survey several concepts and results that are used below. These include 
the lexicographic order and the Hopf-Stiefel function. Then we discuss compressions in Zjj, in line 
with Section 2 of |17j . and we introduce the study of compressions that preserve affine generation. 

2.1 The Lexicographic Order 

Throughout, we use the linear basis {e\, ea, e n } for Z£. Elements i g ZJ are expressed as 
x = 'YHi=\ x i e i- The correspondence between vectors x £ Z 2 an d their supports {j | Xj = 1} C 
{1, ...,n} = [n], is used to simplify certain notation and arguments. 

The lexicographic order is a total order on ZJ . For ije Zj we say that x -< y, if Xi < yi for 
the largest coordinate i for which Xi ^ yi- For example, the ordering of Zj is: 

-< ei -< e 2 -< ei + e 2 -< e 3 -< e% + e 3 -< e 2 + e 3 -< e% + e 2 + e 3 

The height, h{x) of an element a; in a finite totally ordered set is x's place in that order. For a 
set of elements A we denote h(A) — J2xgA h{ x )- 

If T C Z2, then its initial segment of size a, denoted IS(a,T), is the set of the a smallest 
elements of T in the lexicographic order. We use the abbreviation IS (a) = IS{a,1^) for n € N 
large enough. 

2.2 The Hopf-Stiefel Function 

For the reader's convenience we prove the following observation of Bollobas and Leader pQ . 

Proposition 3. For two initial segments IS(a),IS(b) C Z£, the sum IS (a) + IS(b) is an initial 
segment as well. 

Proof. For z -< x + y, we claim that z — x' + y' for some x' < x and y' < y. Let i g N be largest 
index such that Xi = 1 or yi = I. Say xi = 1. If Zj = 0, then clearly z -< x, so we can take 
x' = z and y' = 0. If z, = 1, then note that (z — ej) -< [x — e,) + y. By induction on i, obtain 
(z — e^) = x" + y" for a;" r< (x — ei) and y" ^ y, and choose x' = x" + e, and j/' = y". □ 

The Hopf-Stiefel binary function a o b can be defined onNxN as follows: 

aob= \IS(a)+IS(b)\ 

Proposition [3] can be restated as: IS (a) + IS(b) = IS(a o b). This definition is relevant for us for 
the following reason. The cardinality of a sumset of two sets of given cardinalities is minimized by 
taking the two sets to be initial segments: 



a o b — min < \A + B 



A,Be Z 2 , \A\ = a, \B\ 
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Note that here the sets are not required to be affinely generating. This result can be deduced by 
the technique of compressions as we discuss below. See Lemma [6] 

In particular, taking A = IS (a) and B = IS(b\) U (e n + ISfa)) for n large enough, one can 
verify the sub-distributive law: 

a o (pi + fe 2 ) < a b\ + a o & 2 

Similarly, one can deduce the recursive relations for a, b < 2": 

a o (2" + b) = T + a o b 
(2" + a) o (2" +b) = 2 n+1 

These two formulas can be taken as an alternative definition of the Hopf-Stiefel function [37] . 

The function first arose in works of Hopf [20] and Stiefel [35] . They used tools from algebraic 
topology to prove that aob provides a lower bound for solutions of the Hurwitz problem, concerning 
real quadratic forms (see [31]). The relation to set addition in Z 2 was given by Yuzvinsky [55] , 
As it turns out, the Hopf-Stiefel function arises in the study of several more problems in various 
contexts. There is also a base-p analog of the this function for p > 2, see [7J. For a survey, see [5]. 

2.3 Compressions 

For I = {ii,i 2 ,...} C [n], denote Hi — (0, e^, ei 2 , ...} < Z 2 . As usual, if H is a subgroup of G, 
we denote by G/H the collection of all 7J-cosets in G. The I -compression of a subset A C Z 2 is 
defined by: 

C7/(A)= (J IS(\AnT\,T) 

In words, in every TJf-coset T we replace the elements of AnT by a same-cardinality initial segment, 
with respect to the lexicographic order. We say A is compressed with respect to /, or I -compressed, 
if Ci(A) = A. In particular, lexicographic initial segments of Z 2 are exactly all [n]-compressed sets. 

Example 4. C{i, 2 ,3}({0, ei, e 2 , e 3 , e 4 }) = {0, ei,e 2 ,ei + e 2 ,e 4 }. 

This notion of compression is closely related to the operation bearing the same name from 
extremal set theory (see, e.g., [H]). A subset of Z 2 naturally corresponds to a family, a.k.a. set- 
system, F of subsets of [n]. We freely move between these terminologies if no confusion can occur. 
An {i}-compression corresponds to the push-down operator Ti, which replaces J G J- by J \ {i} 
provided that J\ {{} £ T . \i JF is {z}-compressed for each i, then it is closed under taking subsets 
and is called a downset. The shift operator Sij replaces j by i wherever possible. Namely, for every 
J with i,j J it replaces J U {j} by J U {i} given that the former belongs to T and the latter 
doesn't. We say that T is shift-minimal if it is invariant to all shifts Sij where i < j. One can 
check that being {i, j}-compressed for all i,j G [n] corresponds to being a shift-minimal downset. 

Compression can simplify matters substantially, while preserving several useful features of the 
set-system. Here are some observations about compressions. These and others are found in [17j . 
The proofs are straightforward, working coset by coset. 

Lemma 5 (Properties of compressions). Suppose iCZJ and I C [n]. 

(1) \Ci{A)\ = \A\. 

(2) Ci(A) is I -compressed. 

(3) h(Ci(A)) < h(A) with equality iff A is I -compressed. 

(4) An I -compressed set is J -compressed for all J C I. 

(5) Ci{A) C Ci(B) for all ACB. □ 

Compressions behave well on sumsets. By Proposition [3] one can deduce that the sum of 
two /-compressed subsets is /-compressed too. The following well-known lemma deals with the 
compression of a sum of two general subsets. For the sake of completeness, we prove it here, 
following p] and [P7] . 
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Lemma 6 (Sumset compression). Suppose A,J]C Z§ and I C [n]. Then 

C I (A)+C I (B)CC I (A + B). 
Consequently \Ci{A) + Ci(B)\ <\A + B\. 

Proof. We use a double induction, on |/| and on h(A) +h(B). For the induction step, suppose that 
for some J C I either A or B is not J-compressed. In this case 

d(A) + d(B) = d(Cj(A)) + Ci{Cj{B)) C d{Cj(A) + Cj{B)) C C/(Cj(i4 + B)) = d(A + B) 

Both inclusions are by the induction hypothesis: the first one since h(Cj(A)) + H(Cj(B)) < h(A) + 
h(B) by property (3) of Lemma [5j and the second one since |J| < \I\ and by property (5). The 
equalities are by property (4). 

It only remains to verify the lemma for A and B that are both J-compressed for all J C I. We 
start with the simpler case n — 

What are the subsets of G = that are J-compressed for all J C [n]l By property (4), all 
initial segments are such. If S C G is not an initial segment, then necessarily x ^ S and y € S for 
some consecutive x -< y. The only consecutive pair in G that is not contained in a proper ff/-coset 
is (ei + ... + e„_i) -< e„. One can verify, for example by S being [2...n]-compressed, that the only 
such set is S — i?[ n _i] \ {ei + ... + e„_i} U {e„}. In conclusion, it is enough to check the case where 
A and B are initial segments or equal to S. Now there are four cases to consider: 

1. If both A and B are initial segments, then by Proposition [3] A + B is an initial segment too. 
=> d{A) + d(B) = A + B = d(A + B) 

2. If A = B = S, then note that \S\ < \S + S\ and d(S) = H [n _ 1} . 
=> Cj[S) + Ci(S) = d(S) C d(S + S) 

3. If B = S and A is an initial segment with \A\ < \S\, then A = Ci(A) C ^(S 1 ) = 
4C,(i) + C,(S) = C 7 (S)cC;(A + S) 

4. If B = S and A is an initial segment with |A| > |5| then \A\ + \S\ > \G\. 

This means A + S = G, as the reader may verify by a standard pigeonhole argument. 
C/(A) + CMS) = G = d(G) = C7(A + 5) 

The case n > |7| is implied by the case n = \I\: 

d(A) + d(B)= |J ((aj(A)+Ci(B))nff c ) 

H c eG/ Hi 

= U U ((C7(A)nff a ) + (c 7 (B)n/f 6 )) 

H C £G/H T H a + H b = H c 

= U U (C/^nff^ + dCsniife)) 

H c eG/H I H a + H b = H c 

£ U U C/pni«) + (snH t )) 

H c gG/Hj H a +H h =H c 



^ U Cl ( U ((^nfr„) + (5nF 6 ))] 

H a eG/H, \H a +H b =H c ) 



= |J ^((A + Bjnffc) 

H c eG/ Hi 

= (J (C/fA + Bjn^) 

H c eG/ Hi 

= Ci(A + B). 

The first and second inequalities are simply dividing into cases, according to the involved 
cosets. The third one holds because compressions work coset-wise. Then there is inclusion by the 
assumption on the case I = [n], applied to our Hi and translated to the relevant iJ/-cosets. And 
then, inclusion of initial segments, because the union is at least as large as each of its components. 
The three remaining equalities are similar to the first three. □ 
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2.4 Compressions that Preserve Affine Generation 

As Lemma [6] shows, in the problems we consider here, compressing the sets under consideration can 
only improve our objective function. However, we are restricting ourselves to affinely generating 
sets and compression may destroy this property (e.g., Example [4]). Therefore, our strategy is to 
keep compressing as long as affine generation is maintained. To this end we introduce the following 
definition. 

Suppose that A Z) E, where E — {0, ei,e2, ■■■,€„} is the standard affine basis of If A is 
J-compressed for every I such that Ci(A) D E, we say that A is ((E)) -compressed. Note that by 
part (3) of Lemma [5] every set A containing E, can be turned into an ((E)) -compressed set by a 
finite sequence of such compressions. It turns out that ((E)) -compressed sets are very structured. 

Lemma 7 (Structure of ((-E))-compressed sets). Let iCZj be an ((E)) -compressed set. 

(1) A is a shift-minimal downset. 

(2) A contains a subgroup of maximal size iJoZJJ of the form H = (0, e%, eh). 

(3) A is {1, h,h + i}-compressed for every 1 < i < m — codim H . 

(4) AC H + E, i.e. A = H U A 1 U A 2 U ... U A m where Ai = An (e h+i + H). 

(5) Forl<i<m,Q< \AA < \H\. 

(6) Forl<i<j<m, \Ai\ + \AA < \H\. 

(7) Ifm > 1, then \A\ < (l + f ) \H\. 

Proof. The proofs are fairly straightforward. 

(1) It is a simple observation that both {z}-compressions and {i, j}-compressions preserve E C A. 
Hence A must already be compressed with respect to these sets, i.e., a shift-minimal downset. 

(2) Let h be the maximal dimension of a subgroup contained in A. As shown below in Lemma |8j 
a subgroup of dimension h must contain an element of Hamming weight at least h. By 
shift-minimality e\ + e 2 + ... + e/, G A, and by the downset property H = (0, ei, e^j C A. 

(3) Denote I = {1, h,h + i}. The sets H U {e^ +i } and {e^+j} for j ^ i are initial segments of 
their _ff/-cosets. These sets cover E and remain included in A through the /-compression. 

(4) By the downset property, it is sufficient to show e/ l+ ; + eh+j 4- A f° r each 1 < i < j < m. 
Indeed, if A contains eh+i+eh+j then it contains eh+i+H by being {1, h, ft.+j}-compressed. 
This implies H U (eh+i + H) C A, contrary to the maximality of the subgroup H in A. 

(5) For the lower bound note that eh+i € E C A. On the other hand, if \Ai\ = \H\ then 
H U (eh+i + H ) C A, contrary, again, to the maximality of H . 

(6) Note that e^+j G A, while some lexicographically smaller elements in eh+i + H are not 
contained in A. Therefore A can't be J-compressed for I = {1, h, h + i, h + j}. Since it is 
((-E))-compressed, this means eh+j Ci(A). Equivalently, \A n Hj\ < 2\H\, which leads to 
our claim. 

(7) If |^| < \\H\ for every i, clearly \A\ = \H\+J2Zi \M < i 1 + f ) 1^1- Otherwise, \A\ > \\H\ 
for some i, thus \Aj \ < \H\ — < \ \H\ for every j ^ i. So + \Aj\ < \H\ for some i and 
j, and the remaining ^4j's are no bigger than 

□ 

Lemma 8. Let H be an h-dimensional subgroup ofTL^. Then H contains an element of Hamming 
weight at least h. 

Proof. If h = n, take e\ + e-2 + ... + e n . Otherwise, there exists a basis element such that ej ^ EL. 
In this case, moving from H to Cu\(H) simply deletes ej from the standard basis representations 
of JTs elements, thereby not increasing their Hamming weights. Now note that C{iy(H) is an 
/i-dimensional subgroup of (0, e%, e^-i, e%+\, e n ), and by induction on n contains an element 
of Hamming weight at least h. □ 
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3 The Freiman— Ruzsa Theorem in 

For A C Z£ we refer to | (A) \/\A\ as A's spanning constant and to K = \ A + A\/\A\ as its doubling 
constant. The Freiman-Ruzsa theorem gives an upper bound on the spanning constant in terms 
of K. We first review the theorem and some of its quantitative aspects. Then we calculate the 
bound explicitly, and in particular we determine its correct asymptotics which turns out to be 
<d(2 2K I K). We present the proof in two stages, starting with the asymptotic estimates. We find 
this presentation convenient, since the proof of the asymptotic bound already contains our main 
ideas. 



3.1 Brief Review of the Freiman— Ruzsa Theorem 

Freiman's celebrated theorem [14] states that if A C Z is a finite subset with | A + A\ < K\A\, then 
A is included in a generalized arithmetic progression, whose size (relative to \A\) and dimension 
are bounded. The bounds depend only on K and not on \A\. Ruzsa [55] has made crucial 
contributions to this area. More recently much work was done on similar problems where Z is 
replaced by other groups. In particular Ruzsa [31] proved the analogous result for abelian torsion 
groups. See [37] for a nice exposition. 

Theorem 9 (Ruzsa). Let G be an abelian group in which every element has order at most r. If A 
is a finite subset of G with \ A + A\ < K\A\, then A is contained in a coset of a subgroup H <\G of 
size \H\ < f(r,K)\A\, where 

f(r,K) < K 2 r K \ 

Better estimates on /(r, K) were subsequently found. We denote by F(r, K) the smallest bound 
for which this statement holds. Note that F(r, K) is non-decreasing in K and F(r, 1) = 1. 

By considering the case where A is an affine basis of Z^^ -1 "* we see that F(r, K) > r 2K -°( l °gK) 



(see Example 11 below). This suggests the following conjecture 



Conjecture 10 (Ruzsa). For some C > 2 we have F(r,K) < r CK . 

In an attempt to understand the role of torsion in these phenomena, much work was dedicated 
to the special case r = 2, where G = Z^- This work is also motivated by the role that Z£ plays in 
discrete mathematics and in particular in coding theory [3]. We introduce the following notation: 



F(K) =F(2,K) =sup 



f \{A)\ 
\ \A\ 



A C Z2, n e N, 1 , , , 1 < K 



\A + A\ 



As already observed by Ruzsa [S], for r — 2 his method gives somewhat more, namely F{K) < 
K2 <lK ^ _1 . Later work by Green and Ruzsa [16] gave F(r,K) < K 2 r 2K " 2 , which was again 
refined for r = 2 to F(K) < 2°( K ' logK ) by Sanders [35]. Using compressions, Green and 
Tao T71 were able to prove F(K) < 2 2K +°( vl?lo s K ) . Note that this confirms Conjecture 
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for 

r = 2. The best bound so far is due to Konyagin [53] who further improved this method to derive 
F{K) < 2 2K +°( lo & K \ 

The range of small K has received some attention as well. In the sub-critical range K < 2, the 
exact value of F(K) is known to be F(K) = K for 1 < K < 7/4 and F(K) = f K for 7/4 < K < 2. 
See [H HTl [HI (261 ST] . For K < 12/5 we have F(K) < (2K-l)/(3K-K 2 -l) and for 12/5 < K < 4, 
a recursive formula is available. See [5]. 

The following simple construction [3T] provides a lower bound on F(K). 

Example 11 (Independent Points). Consider the subset: 

A [t] ={0, ei ,e 2 ,...,e t } CZ\ 

Here, for t g N we have F ( ^rpj- ^ > jjp^, and by monotonicity one can obtain: 

F(K)>^2 2K (l-o(l)) 
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3.2 Asymptotics of F(K) 

We first prove a new upper bound, which coincides with the construction in Example |11| for t € N. 
Theorem 12. F f ^4tT~~ ) - tTT /<"" 2 < t e M. Consequently, 

F{K)<^ K (l-o(l)). 

The exponential term 2 2if is as in [T71 [23] , but the polynomial coefficient 1/ is new. Thus 
it re-proves Conjecture [10] for r = 2 with C = 2. This bound and Example [TT] determine the 
asymptotics of F(K) up to a factor of 2. In the next section we calculate F{K) exactly, and show 
that the gap is unavoidable and results from the oscillations in F{K). 

Proof. For an affinely generating subset A C G — 1^ , it is sufficient to prove: 

\A\ = ^\G\ =► \A + A\> ®+* + 1 |G| (1) 

where 2 < t e R. Since both expressions are monotone in t, the theorem follows. 

As in [T7], the main tool is reduction to compressed sets of some sort. First, since (A) — G we 
can assume that A contains an affinc basis for G. But \ A\, \ A+A\ arc not affected by invcrtiblc affinc 
transformations, so we may assume without loss of generality E C A, where E — {0, e\, ea, e„} is 
the standard affine basis of G. Now we assume without loss of generality that A is ((-E))-compressed. 
Indeed, supposing ([!]) holds for ((_E))-compressed subsets, we proceed to general subsets inducting 
on h(A). Let I C [n] be a set such that E C C/(A) ^ A. By Lemma[6j \Ci(A) + C r (A)\ <\A + A\ 
while | Cj (A) | = so A satisfies JT]) provided that Ci(A) does. The inductive argument applies, 
since H(A) > ?i(Cj(A)) by Lemma [g3). 

We continue the proof using the structure of ((E)) -compressed sets. As in Lemma [7] let H C A 
be a maximal subgroup, h = dim , to = codim H and Ai — A(~] (e^+ j + -ff) for 1 < i < to. By 
Lemma [TpO, |A| < (l+m/2)|if|, and an upper bound on m is given by 

1+1 > \A\ 



\G\ 

where the case m = 1 follows from the assumption 2 < i. 

Given m, Lemma [7^4) gives a decomposition of A into to + 1 parts, and we use it to show that 
A + A is at least ~ to/2 times larger than A. This is shown by the following calculation, where all 
indices go from 1 to to and all unions are disjoint: 

A = H U [jA, 

i 

A + A = H U \J(Ai + H) U \J(Ai + Aj) 

i i<j 

El^ + ^l > EniaxdAU^I) > ^ 1^1 + 1^1 = = ^(lAI-lffl) 

* ,, ^ ,„, i ,,, m — 1,,., 1 7-7- j \ to + 3 I CI m — 1, ., 
=► |A + A] > |JJ|+m|if| + ^— (\A\-\H\) = _-_.L-± + _-_|A| 

The right-hand side is decreasing in m in the real interval where ((m + 3) log 2 — l)/2™ 1 > |A|/|G|. 
This interval includes the range of our interest, which is (m/2 + l)/2 m > |A|/|G| = (t + l)/2 ( , or 
equivalently m < t — 1. Thus, we obtain a lower bound on \A + A| by evaluating this expression at 
t — 1, namely: 

□ 
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3.3 Exact Calculation of F(K) 

Theorem [2j which we will shortly prove, provides an explicit formula of F(K). This enables one to 
rederive the asymptotics of F(K), and to deduce the following corollary. 

Corollary 13. Both bounds in the asymptotic inequalities 

±2 2K (l-o(l)) < F(K) < ±2™ (1 -o(l)) 



are sharp up to the o(l) terms. 

It also settles the following conjecture of Diao [5J. 
Corollary 14. F{K) is a piecewise linear function. 



□ 



□ 
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Figure 1: An illustration of F(K) 

In order to calculate F(K), it is useful to consider a related function K(F), which is defined for 
rational numbers of the form F = 2 a /b > 1. 



K(F) = inf ■ 



\A + A\ 
\A\ 



A c Z™, neN, 



1(^)1 



F 



That is, the minimal doubling constant of an afHn ely g enerating set of relative size exactly 1/F. By 



definition F(K) = sup{F | K{F) < K}. Theorem 12 asserts K (2*/(t + 1)) >((*)+ i + l) /(* + 1) 
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for real t > 2, and by Example [XT] it is an equality for t £ N. In order to analyze K(F), we refine 
the arguments in the proof of Theorem [l2j and elaborate on the construction in Example [TT] This 
yields a better view of the structure of sets with a small doubling constant. We begin by describing 
the extended example. 

Example 15. For non-negative integers s,t such that s <t, consider the subset: 
A [t, s ] = {0, e ,ei,e 2 , e t ,e + e x ,e Q + e 2 , ...,e + e f _ s } C Z 2 +1 
It is not hard to verify that 

\A [ttS] \ = 2(t + l)-s, \A M + A M \ = 2 (Q) + * + x ) - (2) ' \( A lt,-])\ = 2 ' +1 - 
Therefore: 

7 >f 2* \ (*)+*+!- @/2 
+ t + 1 - s/2 

This example provides an upper bound on K(F) for a discrete sequence of values. When s = 
However, K(F) is not necessarily monotone, so we cannot imitate the 
and extend the upper bound to general F. Still, the following argument 
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it reduces to Example 
conclusion of Example 
does the work. 

Lemma 16 (Sublinearity of K(F)). If Fx < F2 are in K's domain, then Ki "p^ > K< p?f > . 

Proof. Let F2 — 2 a /b for some a, b £ N. Suppose A\ C Z 2 is an affinely generating set of size 
I Ai| = 2 n /F 1 . Let m £ N be large enough such that a < n + m < &2" +m ~ a . Consider A\ = 
A x x Z 2 n , and note that A[ affinely generates Z 2 l+m and \A[\ = 2 n+m /F 1 . Since Fx < F 2 one 
can take a subset A 2 C j4' x of cardinality |A 2 | = 62" +m_a = 2 n+m /F 2 . Moreover, by m's choice 
n + m + 1 < |A 2 |, so a subset A 2 which affinely generates Z 2 +m can be chosen. Now from 
A 2 + A 2 c Ai + Ai = (Ax + Ax) x ZJ 1 , 

|Ai + Ai| 1 _ |Ai + Ax\ _ \A[ + A'x\ > |A 2 + A 2 | = |A 2 + A 2 | 1 > K(F 2 ) 



I Ai| Fx 2 n 2 n + m ~ 2 n + m \A 2 \ F 2 ~ F 2 

The task is accomplished by taking the infimum over Ai. □ 
Corollary 17 (Superlinearity of F{K)). < for every \<K l <K 2 . □ 



Example 15 and Lemma 16 supply an upper bound on K(F). The following lemma essentially 



claims that this bound is sharp. 

Lemma 18 (Formula for K(F)). Let F > 1 be of the form 2 a /b where a,b £ N, and let s < t be 
the unique pair of non-negative integers for which 

2* - 2* 

t + 1- s/2 ~ < t + 1 - (s + l)/2 

Then 

If (F) = ^ ^ 2 -^- ■ F 

Since the function F(K) is basically the inverse of K(F), Theorem § is a direct consequence 
of Lemma 18 Indeed, Figure [l] is obtained by transposing the graph in Figure [2j and taking the 
maximum wherever the result is multivalued. We omit further details. 

One can notice that K(F) has a more complex structure than F{K). Since Theorem [2] employs 
the information in Lemma 18 only partially, there may be a quicker way of calculating F(K). 
Nevertheless, we feel that the detailed description of K(F) is interesting in its own right, and may 
shed light on the non-trivial form of F(K). 

The proof of Lemma [18] pursues the analysis in Theorem |12| s proof, involving more reduction 
steps which preserve |A| without increasing |A + A|. Through these reductions the structure of A 
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Figure 2: An illustration of K(F) 



becomes similar to Example [l5j so that its doubling constant can be calculated explicitly. We start 
with two reductions which can be formulated separately in terms of integer partitions. All of the 
following will be motivated and applied later, in the proof of the lemma. 

A non- increasing sequence of positive integers ai...a m is an integer partition of a = J^- ai into 
m parts, and for short an m-partition of a. Recall the Hopf-Stiefel function a o b from Section [2] 
We are interested in the minimum of J2i<i<j< m a i ° a j over au " ^-partitions of a. 

A partition ai...a m of a is called compressed if a. L + aj > 2 k =>■ a. L > 2 k for each k and i < j. 



It will be implicit in the proof of Lemma 18 that at least one of the partitions that minimize 
Sj<j a i a j i s compressed. Here we shall restrict the discussion to compressed partitions. 

A partition is called quasi-dyadic if ai...a m _i are powers of 2. No requirement is made on a m . 
Note that a quasi-dyadic partition is always compressed. Our first reduction basically asserts that 
the minimum of X)i<j a i ° a j ^ s attained by a quasi-dyadic partition. 

Lemma 19. A compressed m-partition of a that minimizes Ylj^-j &i o aj is quasi-dyadic. 



Proof. Otherwise, consider the smallest 1 < i < m for which ai is not a power of 2, say 2 k < 
a.i < 2 fe+1 . Since the partition is compressed, a, + a i+1 < 2 k+1 . We 'transfer mass' from a, to 
a^+i. Replace a, by a\ = 2 k , and replace a,+i by a' i+1 = aj+i + a t — 2 k . Note that a\ > a' i+1 and 
monotonicity is preserved. 

How does this move affect 53j<j a i ° a f- By the choice of i, for j < i we have aj = 2 l , where 
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I > k as a partition is non-increasing. Thus the terms involving dj are unchanged: 

dj o a, + dj o a i+ i = 2 l + 2 l = dj o a • + Oj o a- +1 . 

For j > i + 1 we know < a, + i < 2 fe . By the recursive definition of the Hopf-Stiefel function, and 
the sub-distributive law: 

dj o di + dj o Oj+i = 2 k + dj o [di — 2 k ) + dj o dj+i 

> 2 fe + Oj- o (aj - 2 fc + o i+1 ) = dj o a ■ + aj o a- +1 . 

Finally, again by the recursive definition the mixed term becomes strictly smaller: 

di o a i+ i = 2 k + (di - 2 k ) o a i+ i > 2 k = a- o a- +1 

The combination of the last three calculations yields that the sum 2~^<j a i°a-j can be made smaller 
by changing the partition, in contradiction to the minimality assumption. □ 

Since 2 fc o a = 2 fe for a < 2 k , in the quasi-dyadic case the summation can be simplified : 

m 

^ diOdj = ^ max(di,dj) = ^ di = ^(ra — i) ■ dj 

l<i<j<m \<i<j<m l<i<j<m i—1 

It is natural to conjecture that the minimum is obtained when d\...d m arc 'almost' equal. A 
quasi-dyadic m-partition of a is quasi-fair if for some k € N, a, G {2 fc , 2 fe_1 } for each 1 < i < m— 1. 
For example 4 + 4 + 2 + 2 + 2 + 1 and 4 + 4 + 4 + 3 and 8 + 4 + 3 are some quasi-fair partitions of 
15. The following properties of quasi-fair partitions are easily verified: 

1. In the above definition one can choose 

k = |log 2 (a/m)l , 
and then exactly d\...dj exceed 2 fe_1 where 

j = [a/2 fe - 1 ] - m. 

2. For every two positive integers m < a, there exists a unique quasi-fair m-partition of a. 

3. If d\...d m and ^...a^ are the quasi-fair m-partitions of a < a', then di < a- for all i. 

4. A sub-partition (in the sense of a sub-sequence) of a quasi-fair partition is quasi-fair. 
Now we are ready to state the second reduction. 

Lemma 20. The minimum of^2 iK j di o dj over all quasi- dyadic m-partitions of a is obtained only 
by the quasi-fair one. 

Proof. This lemma can be verified by induction on m. For a partition that minimizes the sum, it 
is enough to show d\ = 2 k for k = [log 2 (<x/m)] . By the induction hypothesis d2...d m are quasi-fair, 
and thus constitute the unique quasi-fair sub-partition we are looking for. By the monotonicity 
property applied on d2...d m , for a competing sequence a! x ...a! m with d\ > oi, necessarily a[ < di for 
i > 2, and consequently: 

m m / m m \ 

y^(m -i)-di = - i) ■ di + (m - 1) a ■ - a» 

i=l i=l \i=l i=l / 



1=1 



□ 



With these reductions in hand, we can complete the calculation of K(F). 
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Proof, (of Lemma 18 1 Let (A) = G = Z£. Lemma 18 is proved by showing the following lower 
bound on \ A + A\, which is reached by Example 15 and Lemma [16| 



t+1 



- (j + l)/2 |A| t + 1 -a/2 
2* |G| " 2* 



|+ + A| > Q+i+l-i© 



\G\ 



2< 



(2) 



If |A| > s|G|, then by the pigeonhole principle A + A = G, as required in the cases t = 1, 2. Hence 
we may assume |A| < ||G| and i > 3. 

We start as in Theorem 12 We first assume without loss of generality that A is ((E))- 
compressed, and therefore by Lemma [7] has the following properties: 

• There exists a subgroup H = (0, ei, ej,) such that A = ff U Ai U A 2 U ... U A m , where 
Ai = A n (e/j+i + ff) and m — codim H. 

• l/2 m < \A\/\G\ < (1 + f )/2 m . By the assumptions 0^ < \A\/\G\ < §, we can write 
1 < m < t. 

• Each is a lexicographic initial segment of ej + ff . Therefore yl is uniquely determined by 
the sequence Oi, a m where a« = Note that < at < 2 . 

• By shift-minimality ai > > ••■ > a m - In other words, ax,...,a m is a partition of a = 
|A|-|G|/2 m . 

As in Theorem [12] we use these properties to write A + A as a disjoint union of its intersections 
with £f-cosets, which are of three forms: H, H + Ai and Ai + Aj. Since the Ai's are initial segments 
of their cosets, the sumsets of the third form can be expressed via the Hopf-Stiefel function: 



\A + A\ = \H\+m\H\ 



E 

l<i<j<m 



\Ai + A 3 



1 



\G\ 



l<i<j<m 



This equation makes it interesting to find partitions a\...a m of a, that minimize X)i<j a « a i- 

We next show that the partition ai...a m is compressed. For i < j and 1 < k < h we exclude 
the case where a, < 2 fc < a, + flj by the assumption that ^4 is already ((£'))-compressed. For 
/ = {1, 2, k,h + i,h + j}, let's examine the set Ci(A). Ai is replaced by an initial segment of 
&h+i + H of size 2 fc , and Aj is replaced by an initial segment of eh+j + H of size a, + — 2 fc , which 
is not empty by assumption. In other words, becomes 2 fe , and £ C A is preserved. 

By Lemmas 19|[20 if \A + A\ is minimal then a\...a m is the quasi-fair quasi-dyadic m-partition 
of a = \A\ - \G\/2 m . In this situation 



A + A\-^-\G\ = Y / (™-i)-2 k 



(m-i)- 2 11 - 1 = 



i=3+l 



for < j < m and < fc < (dim G — m) such that 



m + j — 1 



2 fc < |A| 



|G| 



< 



m + j 



m-j 
2 



Remark. Note that in the cases j 



1 or j = m we can choose j' = and fc' 



We could avoid this freedom of choice by not permitting j 
resulting \A + A\, we allow both ways. 



+ 1 as well. 

0, but since it does not affect the 



All that remains now is to show that, as in Theorem 12 to minimize \A + A\ we should make 



m as large as possible, i.e., m = t — 1. The proof is by induction on t — m: 
• Suppose m = t — 1. We check that ([2| holds. 



2m- {8 + 1) \G\ 



< \A\ 



|G| 
2 m 



< 



2m 



|G| 



2 2 m +i 2 m — 2 2 m +i 

Denote fc = dimG — m — 1 and j = m — s, and observe that < j < m. Then the above 
expression for the minimal \A + A\ becomes 



\A + A\ = 



1 



|G| 



1 (m 

2 ' 



(m 
2 



2m+l 



t+1 



K2) 



■IGI 
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Suppose m < t — 1. The above discussion yields a compressed set A, such that a\...a m is the 
quasi-fair quasi-dyadic partition of \A\ — \H\, and \ A + A\ is minimal given to, and equals: 

, . ., TO + 1 ^ 

l^ + ^l = ^H G l + 2-, a *°°i 

l<i<_7<m 

We show that increasing to makes |j4 + A| smaller. Denote by A', H' and a' 1 ...a' m + 1 the 
corresponding set, subgroup and partition for ml = to + 1. Similarly: 

l^ + ^l = UrVl+ E 4° 4 

l<i<j'<m+l 

Now define a = \H'\ = \H\/2 = \G\/2 m+1 . Since a\...a m is a quasi-dyadic m-partition for 
to > 1 and ai < \H\, necessarily ao > a< and a^o ai = clq for all 1 < i < to. Hence for the 
quasi-dyadic (to -I- l)-partition ao...a m : 

\a a, f m + l to \ \ - to + 2 \ - 

^ ' l<i<J<m ~ 0<i<j<m 



But by Lemma 20 the partition a[...a' rn+1 gives the minimal value for this expression. More- 
over, since ao = > a\ these partitions differ and \A' + A'\ < \ A + A\. 

□ 

Remark. An examination of the proof reveals two kinds of reduction steps. Either A is compressed 
without changing (A), or we find a set A' where \A'\ = \A\ and \A' + A'\ is substantially smaller 
than \A + A\. Hence, the proof actually provides a characterization of the extremal case, up to 
compressions that preserve (A) and + 



4 Addition of two different sets 

What is the smallest possible cardinality of A + B if A, B C G = Z£ are two affinely spanning 
subsets of given cardinalities? In this section we prove Theorem [TJ which gives an essentially 
complete answer. In addition we establish a new isoperimctric inequality, which is used in the 
proof. But first, we make some remarks concerning the theorem. 

Remarks (on Theorem [T]). 

1. Tightness: Consider (|A|/|G|, \B\/\G\, \A + B\/\G\) as a point in [0, l] 3 . The Hamming balls 
construction shows that the bound goes through the points of the form: 

/ 1 + * l + t + ... + Q l + t + + (4J 
^ 2* ' 2* ' 2* 

An inspection of Figure [3] shows that all points properly inside their convex hull are strictly 
below the bound, and hence cannot be realized by such sets. In other words, further improve- 
ments of the bound will be local in nature. 

2. The formulation of the theorem apparently breaks the symmetry and doesn't require (B) = G. 
Still, there is an asymmetry in the result as well, and the theorem is of interest mostly when 
|A| < \B\. See also the remark after the proof. 

3. In order to simplify the statement of the theorem, t is defined as the largest positive integer 
such that < (t + l)/2* . However, the only assumption on t which the proof actually 
uses is: 

t + 2 \A\ 
2*+! < \G\ 

The theorem can, therefore, be applied as well with t larger than in the given formulation. 
As Figure [3] shows, the resulting bound would be weaker, but may still be useful in certain 
contexts. 



< k < t 
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|A| / |G| = 6/32 
optimal construction 
|A| / |G| = 7/64 
|A| / |G| = 8/128 
|A| / |G| = 9/256 
|A| / |G| = 10/512 
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Figure 3: An illustration of the lower bound 

Theorem [l] implies that a large enough number of large enough affinely generating sets must 
add up to the whole group: 

Corollary 21. Suppose that (Ai) = (A 2 ) = ... = (A rn ) = G = Z% with \Ai\/\G\ > (m + 2)/2 m+1 
for all i. Then A 1 + A 2 + ... + A. m = G. 

Proof. We repeatedly apply the theorem with A = Ai and B = A\ + ... + Ai-i for all 1 < i < m 
to conclude 

[Ai + A 2 + ... + Aj\ m + ('"+ 1 ) + ... + P+ 1 ) 
\G\ 2 m +! 

Indeed, in view of remark 3 above and the assumption on the cardinalities, we may choose t = m+1, 
and then k — i — 1 and w > by the induction hypothesis. Since |Ai+A 2 + ... + A m _i| + |A m | > \G\ 
the proof is completed by the pigeonhole principle, |A| + \B\ > \G\ => A + B = G. □ 



The special case of Corollary 21 where all Aj are identical is due to Lev [3S], following a 
conjecture of Zemor [40J ■ Taking A, = D" l+1 x Z2 _m+1 for each i shows that the assumption on 
the cardinalities is sharp. 

4.1 An Isoperimetric Inequality 

We are inspired by Frankl's short inductive proof [13] of Harper's theorem [18] . 
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Theorem 22 (Harper's Inequality). Suppose ACZ^. 
If for 1 < k < n integer and < p < 1 real 



then 




In simple terms this theorem says that Hamming balls solve the vertex-isoperimetric problem in 
the hypercube. However, it also deals, to varying degrees depending on the version of the theorem, 
with sets of cardinalities strictly between I-D^LJ and \D^\. A stronger version would replace the last 
summand of each expression with (f) and respectively, where x G [k, n] is real. The optimal 

formulation due to Katona [22] and Kruskal [24] is stated in terms of the fc-cascade representations 
( a fc ) + \k-i) anc ^ (fe^"i) + vk-2) res P ec ti ve ly- Frankl's method yields all three formulations. 

Frankl's proof employs several useful operators on set-systems. As usual, we freely move between 
the set-theoretic terminology of 2^ n ' and the algebraic language of Z?? . The push-down operator Ti 
and the shift operator Sij have already appeared in Section [2] The upper and the lower shadow 
operators act on a set-system T C 2^ by 

ST = {JU{i} \JeT,i(£J} 
dT = {J\{i} \JeT,ieJ} 

respectively. For downsets, the notion of the shadow is close to that of the neighborhood in the 
theorem. If C C Z£ is a non-empty downset, then C + D'l = 8C U {0}. Note that always ^ SA. 
Another useful operation on set-systems is classification by n, denoted by: 

T- = {J \JeT,n(£J} 

T + = {J\{n} I J e T.n e J} 

When A C 7*2, we regard A + and A~ as subsets of . 

Following Frankl [13] . we proceed with two lemmas regarding properties of shifts and shadows. 

Lemma 23. Suppose C C Z§ is a shift-minimal downset. 

(1) 5{C+) C (5C)+ = C~ with equality iff C = 

(2) 8{c-) = (scy 

Proof. Examine the effect of the operators on the representation of some x £ C with the standard 
basis ei, e n . 

In both S(C + ) and (<5C) + , some is added and e„ is removed. However, in <5(C + ) certainly 
i =/= n since C + lives in ZJp , while in ((5C) + it is possible that i = n. Hence S(C + ) C (SC) + . 
By shift-minimality C is closed under these swaps, thus (<5C) + C C~ . Moreover, every element of 
C~ is obtained by adding e„ and then deleting it, so there is equality. However, d(C + ) is strictly 
smaller since € C~ \ <5(C + ) unless C is empty. 

For 5(C~) — (SC)~, note that both sets consist of elements of the form x + e; for x € C and 
i < n, where and e n do not appear in x's standard representation. □ 

The following lemma is well known. See e.g. [Tl] [5T]. Here we prove it as a special case of the 
compression machinery. 

Lemma 24. For all A C W 2 L and 1 < i, j < n such that i 7^ j, 

(1) S (Sij A) C S tf (SA) 

(2) d (S tJ A) C S y (flA) 
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Proof. By passing from A to — A, it is enough to prove only one of the inclusions. Denote 

A = Ul-=o ^fc wnere A k = A n \ -Dfc_i) • Note that we can work with each A k separately. One 
can write 

S [S tJ A k ) = (D? + dj (A k U \ Dl, 

and 

^ (M fc ) = dj (DJ + (A k U £>£_!)) \ 
yielding our claim by Lemma |6j since D™ + Cy(-B) = Cij(Df) + C i5 {B) C Cy + 5). □ 

Our isoperimetric inequality concerns a family of non-empty downsets Ci...C; C Z?? , rather than 
a single one. For the volume and the shadow we take the average quantities, denoted by: 

E[C] = j£|<7 m | E[6C] = jJ2\8(C m )} 

It is hard to make a meaningful statement about these average quantities without limiting the 
downsets somehow. To see this, consider what happens when each C m is either full or empty. 
We limit the variability of the downsets by assuming the antichain condition. Namely, we require 
that for each i and j, Ci \ Cj is an antichain with respect to set-systems inclusion, or equivalently 
C, Ddd. 

Proposition 25. Suppose C\...C\ C is a family of downsets which satisfies the antichain 
condition. If 

^ (»)+(") + -+G"iM: 

for some integer k > and real number < p < 1, then 

n\ I n\ (n\ in 



Since for non-empty downsets C + D" = {0} U 5C, the corresponding inequality in the language 
of neighborhoods is as follows. 

Corollary 26. In the setting of Proposition \25^ if C\...Ci are non-empty then 

E[0+Dris (») + (») + ... + (») + ^ :i 



Proof, (of Proposition 25 ) We may assume that the downsets are shift-minimal. Indeed, for each 
downset C m clearly SijC m is a downset of the same size, while \6 (SijC m )\ < \Sij (5C m )\ = \5C m \ by 
Lemma 24 If C m > \ C m is an antichain, then C m 3 dC m >, hence SijC m 3 Sij (dC m i) 2 d {SijC m i) 



by Lemma 24 again, and hence SijC m > \ SijC m is an antichain as well. In conclusion, SijC\...SijCi 
satisfy the antichain condition, E[C] = E[SV,C] and E[5C] > E[<5(S , y C)]. After a finite sequence of 
shifts the downsets are all shift-minimal, since for a proper shift ^ m h (SijC' m ) < Y] m ft (C m ). 

The case k = is established separately. Note that in this case E[C] < 1, hence C m — for 
some m. Actually, this is a sufficient condition for k = 0, because all other downsets are either 
or {0} by the antichain condition. Since S{0} — {ei, e n }, clearly E [5C] = n ■ E[C] as required. 

Following Frankl, we proceed by induction on n. By convention (?) = for n < k. Thus, for 
n = the lemma is vacuously satisfied by E [SO] > 0. 

For positive k and n, we employ the induction hypothesis on the families C-f .-.Cr and ...C z + 
in Z?? -1 . It is easily checked that given a downset C m , the sets and C~ are downsets as well. 
In addition, if C m < \ C m is an antichain, then so are its two parts, C~, \ C~ and Cw \ , hence 
the new families satisfy the antichain condition. 

By the induction hypothesis on ...C ( + C Z 2 l_1 , at least one of the following must hold: 



E [C+] < 



'i 

n-l\ fn-1 









'n 


u 


:i) 


+p\ 


y k 




(n — 






. + 


U- 


!)- 





2 J \k-lj V k 
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Use E[C~] = E[C] - E[C+] and Pascal's rule in the first case, or E [C~] > 1 +E[5(C+)] by 
Lemma 23 1) in the second one, to deduce: 



e [cr 



> 



n-l 




n — 1 
1 



n-l 
fc- 1 



n-l 

k 



Note that since fc > each G m is non-empty, so there is proper inclusion in the lemma, which yields 
the extra 1 in the calculation. By the induction hypothesis on ...Cf C : 



E[S(C-)] 



> 



n — 1 



n — 1 
2 



71 — 1 
fc 



n — 1 
fc + 1 



By Lemma 23 E[<JC] = E[ (SC) ] + E[ (<5G) + ] = E[5 (C~) ] +E[C~] , hence by Pascal's rule: 



E[SC]> I 



n 
fc + 1 



□ 



4.2 Proof of Lower Bound 

Proof, (of Theorem [T]) The general idea is similar to the case A + A discussed in the previous 
section. By applying various compressions, the sets A and B acquire certain structural properties. 
These, in turn, allow us to derive estimates on the cardinality of A + B. 

Lemma [6] asserts that compressions do not increase sumsets: \Ci(A) + Ci(B)\ < \A + B\ holds 
while |C/(A)| = \A\ and \Ci(B)\ = \B\. Thus, in the search for a lower bound for \A + B\, one 
can first apply a compression Cj on A and B simultaneously. Since (A) — G, we may suppose 
E = {0,ei,e2, ...,e„} C A and restrict ourselves only to compressions that preserve the inclusion 
E C A. By Lemma [BJ3) , if a compression Cj changes either A or B, then + fi(-B) strictly 
decreases. It follows that every sequence of such compressions must terminate. In conclusion, we 
can assume that both A and B are invariant under these compressions, or for short ((E C A))- 
compressed. This implies that B is /-compressed for every / C [n] such that A is /-compressed. 

Lemma [7] provides a description of A under this assumption. In particular, H C A C i? + E for 
some subgroup H = (0, ei, eh) ■ We next derive some structural properties of B. 

Lemma 27. Suppose A, B C G = Z2 are ((E 1 C A)) -compressed. Let H C A be as in Lemma^ 
Consider G/H = Z™ where m = n — h = codim H , with the basis {e^+i + H, e/,.+ m + //} and 
tte partial order of the corresponding set-system. For 1 <j < |//| Zei 



C,- = 



{//'eG/// |sn//'|>j} 



T/ien Gi...G|^f| are downsets, and satisfy the antichain condition. 

Proof. By Lemma[7]j3), A is {1, h,h + i}-compressed for 1 < i < m, and therefore so is B. 

Let H' -< H" be adjacent //-cosets in the partial order. H" — e^+i + H' for some 1 < i < m. 
Since B is {1, h,h + i}-compressed, B n (H' U //") must be an initial segment of H' U H". Note 
that all H' elements are lexicographically smaller than those of H" . Consequently, if B n H" 7^ 
then necessarily //' C B. In other words, H" € C% =>■ H' G G|#| for each such pair. 

In particular, Cj is a downset because H" £ Cj C Gi //' € G| ff | C G^ , and Gj \ G^ is an 
antichain since Cj \ C k C Gi \ G| H | 2 {H',H"}. □ 

We can conclude now the proof of Theorem [l] in the following three steps: 

1. We use the structure of the compressed sets to find new expressions for the cardinalities of B 
and A + B. Let Gi...G|#| be as in Lemma 27 By interchanging the order of summation: 



r 1 lHl 

\b\= J2 \BnH>\= \BnH'\>jj = ^|G,-| 

H'eG/H H'eG/H 3 = 1 

We estimate \A + B\ in a similar fashion. For 1 < j < \H\, suppose H" € S(Cj) U {//}. We 
show that A + B intersects H" in j elements at the least: 
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• If H" = H, use H C A and £ B f to obtain \(A + B) n H"\ > \(H + 0) f] H\ > j. 

• Otherwise H" — e/j+j + H' for some W £ Cj and 1 < i < m — codim H. Since 
e h+i £E£A, clearly \{A + B)n H"\ > \{e h+i + B) n (e ft+l + JT')I = l# n H'\ > j. 

Consequently: 



\ A + B \-- ^ 



# {j e N | + n H"\ >j} > £ l^(Ci) U 

H"EG/H j=l 



2. We use the isoperimetric inequality in order to obtain a lower bound on | A + B\ given m and 
\B\. Let < fc < m and w G [—1, 1] be such that: 

m = r +1 ) + r 1 +1 )+-+m+^(T) . IGI 

We substitute \B\ = J2 \Cj\ in the left-hand side, apply Pascal's rule to ( m + 1 )...( m + 1 ) on the 
right-hand side, and divide both by \H\ = \G\/2"\ to obtain: 

m\ I m\ I m \ 1 + w I m 



E ^-\H\^J c ^-\u\-\oj \ij ■ - u-i; 2 \i, 



Now by Proposition 25 



E[{7J}U(5C] > 1 



m\ / m\ / m\ 1 + w ( m 

1 + 2 + - + U + 1 



where the union is disjoint since always H SCj. In terms of A and B, this implies: 

u+m> m+m+^+m+o+^LTr) . IGI 

I I — 2 m +i 

3. What values can m = codim take? By Lemma ^7), (l + f ) /2 m > |A|/|G|, where the 
case m = 1 is separately deduced from the assumption |t4|/|G| < 3/4. On the other hand, 
by the theorem's assumption on t, \A\/\G\ > (t + 2)/2 t+1 = (l + |) /2*. Since the sequence 
(l + 5) /2™ is monotone, we infer m <t. 

The theorem is obtained by plugging m = t — 1 into the derived lower bound. We claim 
that for smaller m the bound is even higher, as demonstrated in Figure [3j Indeed, for each 
1 < % < t — 2, the graph of the lower bound on \A + B\ given m — i is concave down by 
the log-concavity of the binomial coefficients, (™) / > (^+1) / (™ ) ■ Thus the graph of 

TO = i + 1, which connects the midpoints of adjacent segments in the m — i graph, must be 
lower. 

□ 



Remark. By the Hamming balls construction, the lower bound we have found is optimal on a 
biparametric discrete family of points. In view of our treatment of F(K) in the previous section, 
we expect that at intermediate points better bounds should be provable. 

There are three points where our approach to Theorem [T] may be suboptimal: the isoperimetric 
inequality we use is not always perfectly tight, the addition of H U E instead of the whole of A, 
and dropping the assumption on i?'s affine span. 

It is perhaps worth remarking that the machinery of compressions can still be applied under 
the assumption (A) = (B) = G. This is done by showing that without loss of generality we may 
assume that A and B are simultaneously compressed such that they include a common affine basis. 

Here is a brief outline of how this is done. First, partition A and B into their intersections with 
cosets of ((A — A)D (B — B)). These parts can be translated without increasing \A + B\, such that 
A — A and B — B include a common basis of G. Then apply {i}-compressions with respect to this 
basis, until it is included in A n B. 
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